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IN THE CLAIMS 

Please amend the claims as follows: 

1 . (Withdrawn) A method for processing a memory instruction, the method comprising: 
obtaining a memory instruction; 

obtaining one or more memory address operands; 

creating a virtual memory address using the one or more memory address operands; 
translating the virtual memory address into a physical memory address; and 
executing the memory instruction using a cache controller, 

wherein the cache controller uses the memory instruction and the physical memory 
address to determine whether to access a portion of a local or a remote cache. 

2. (Withdrawn) The method of claim 1 , wherein the obtaining of the memory instruction 
includes obtaining a scalar memory instruction. 

3. (Withdrawn) The method of claim 1, wherein the obtaining of the memory instruction 
includes obtaining a memory instruction selected from a group consisting of a scalar load 
instruction, a scalar store instruction, a prefetch instruction, a synchronization instruction, and an 
atomic memory operation (AMO) instruction. 

4. (Withdrawn) The method of claim 1 , wherein the obtaining of the memory instruction 
includes obtaining a scalar store instruction, and wherein the method further comprises obtaining 
scalar store data. 



5 . (Withdrawn) The method of claim 1 , wherein the translating includes translating the 
virtual memory address into a physical memory address using a translation look-aside buffer 
(TLB). 
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6. (Withdrawn) The method of claim 1 , wherein the method further comprises 
committing the memory instruction before execution. 



7. (Currently Amended) A computerized method, comprising: 
obtaining a memory request; 

storing the memory request in a first memory request container an Initial Request Queue 
(IRQ) ; and 

processing the memory request from the first memory request container IRQ by a cache 
controller, wherein the processing includes: 

identifying a type of the memory request, and 

processing the memory request in both a local cache and an Force Order Queue 
(FOO), as a function of a first condition, and wherein processing includes determining if a 
portion of an address associated with the memory request matches one or more partial addresses 
in the FOO and, if the memory request misses in the cache and the address does not match one or 
more partial addresses in the FOO, adding proces s ing the memory request using a second 
m e mory r e qu e st container aa a function of a s e cond condition, wherein th e proc e ssing of th e 
memory request using the second memory request contain e r includ e s updating a portion of fee 
local cache with a portion of a r e mote cach e to the FOO and allocating a cache line in the local 
cache corresponding to the local cache miss. 

8. (Original) The computerized method of claim 7, wherein the obtaining of the memory 
request includes obtaining a memory load or a memory store request. 



9. (Canceled) 

10. (Canceled) 



11. (Canceled) 
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12. (Currently Amended) The computerized method of claim 44 7, wherein the processing 
of the memory request using the FOQ as a function of th e s e cond condition includes processing 
the memory request using the FOQ when the memory request matches a corresponding request 
in the FOQ. 

1 3 . (Currently Amended) The computerized method of claim 44 7, wherein the processing 
of the memory request using the FOQ as a function of the second condition includes adding the 
memory request to the FOQ when the memory request is not initially processed by the local 
cache , and when the memory r e qu e st do e s not initially match a corresponding request already in 
t he FOQ 

14. (Currently Amended) The computerized method of claim 4-4- 7, wherein the processing 
of the memory request using the FOQ as a function of th e second condition includes processing 
the memory request in the FOQ when local cache processing is bypassed. 

1 5 . (Currently Amended) The computerized method of claim 44 7, wherein the processing 
of the memory request using the FOQ as a function of the second condition includes processing 
the memory request in the FOQ when the memory request includes a synchronization request 
that causes local cache processing to be bypassed. 

16. (Withdrawn) A computerized method, comprising: 

adding a set of memory instructions of a first type to a memory instruction container; 
executing the set of memory instructions of the first type from the memory instruction 
container; 

receiving a memory instruction of a second type; 

receiving a synchronization instruction that temporarily blocks further execution of 
memory instructions of the first type from the memory instruction container; 

adding an additional set of memory instructions of the first type to the memory 
instruction container; 

executing the memory instruction of the second type; and 
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upon such execution of the memory instruction of the second type, executing the 
additional set of memory instructions of the first type from the memory instruction container. 

17. (Withdrawn) The computerized method of claim 16, wherein the adding of the set of 
memory instructions of the first type to the memory instruction container includes adding the set 
of memory instructions of the first type to a memory instruction queue. 

1 8 . (Withdrawn) The computerized method of claim 1 6, wherein the adding of the set of 
memory instructions of the first type to the memory instruction container includes adding a set of 
scalar memory instructions to the memory instruction container. 

1 9. (Withdrawn) The computerized method of claim 1 6, wherein the executing of the set 
of memory instructions of the first type from the memory instruction container includes 
accessing a local or remote cache unit. 

20. (Withdrawn) The computerized method of claim 1 6, wherein the receiving of the 
memory instruction of the second type includes receiving a vector memory instruction. 

2 1 . (Withdrawn) The computerized method of claim 1 6, wherein the executing of the 
memory instruction of the second type includes accessing a remote cache unit, and updating a 
local cache unit with contents from the remote cache unit. 

22. (Currently Amended) A scalar processor, comprising: 
a cache; and 

a scalar load/storo unit having a first container, a second container, and an Initial Request 
Queue (IRQ); and 

a cache controller; 

wherein the IRO buffers a scalar load/store command having a scalar load/store 
instruction and one or more addresses and sends the scalar load/store command to the cache 
controller and the cache; 
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wherein the cach e controller obtains a scalar load/store command from the first container, 
the scalar load/store command having a scalar load/store instruction and one or more addr e sses, 

wherein the cache services the scalar load/store command received from the IRQ when 
the scalar load/store command hits in the cache; 

wherein the cache controller includes an Force Order Queue (FOQ), wherein the scalar 
load/store command is added to the second container FOQ when the scalar load/store command 
misses in the cache; and 

wherein one or more lines in the cache are allocated by a remote cach e for cache line 
replacement when the scalar load/store command is added to the second container FOQ and the 
address for the cache line does not match a partial address in the FOQ . 



23. (Canceled) 



24. (Original) The scalar processor of claim 22, wherein the scalar load/store unit further 
includes an address generator to generate one or more physical addresses from the one or more 
addresses of the scalar load/store command. 



25. (Original) The scalar processor of claim 24, wherein the address generator generates 
the one or more physical addresses using a translation look-aside buffer (TLB). 

26. (Withdrawn) A processing unit, comprising: 
a remote cache interface; and 

a scalar processor having a local cache, 

wherein the scalar processor dispatches a memory instruction, obtains one or more 
address operands, creates a virtual address from the one or more address operands, translates the 
virtual address into a physical address using a translation look-aside buffer (TLB), and executes 
the memory instruction using a cache controller, the cache controller determining whether to 
obtain data from the local cache or to allocate one or more lines of the local cache through the 
remote cache interface. 
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27. (Withdrawn) A processing unit, comprising: 
a cache interface; 
a vector processor; and 
a scalar processor, 

wherein the scalar processor processes a plurality of scalar memory instructions in a 
memory instruction container, 

wherein the vector processor receives a vector memory instruction, 

wherein the scalar processor receives a synchronization instruction and temporarily 
blocks further processing of scalar memory instructions in the memory instruction container, 

wherein the vector processor processes the vector memory instruction by accessing a 
cache through the cache interface, and 

wherein the scalar processor continues to process further scalar memory instructions in 
the memory instruction container after the vector processor has processed the vector memory 
instruction. 

28. (Withdrawn) The processing unit of claim 27, wherein the memory instruction 
container comprises a memory instruction queue. 

29. (New) A scalar processor, comprising: 
a cache; 

an Initial Request Queue (IRQ); and 
a plurality of cache controllers; 

wherein the IRQ buffers a scalar load/store command having a scalar load/store 
instruction and one or more addresses and sends the scalar load/store command to one of the 
cache and to the cache controllers; 

wherein the cache services the scalar load/store command received from the IRQ when 
the scalar load/store command hits in the cache; 

wherein one or more of the plurality of cache controllers includes an Force Order Queue 
(FOQ), wherein the scalar load/store command is added to the FOQ when the scalar load/store 
command misses in the cache, and 
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wherein one or more lines in the cache are allocated for cache line replacement when the 
scalar load/store command is added to the FOQ and the address for the cache line does not match 
a partial address in the FOQ. 

30. (New) The scalar processor of claim 29, wherein the cache controller with the 
FOQ includes a FOQ index array. 

3 1 . (New) The scalar processor of claim 29, wherein the FOQ is divided logically 
into a first and second queue, wherein the first queue manages requests to memory and the 
second queue manages data to be stored to cache. 

32. (New) The scalar processor of claim 22, wherein the FOQ is divided logically 
into a first and second queue, wherein the first queue manages requests to memory and the 
second queue manages data to be stored to cache. 



